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ABSTRACT 

The present research presents the results obtained 
from a practical study with respect to the evaluation 
of the quality of a forecast for an industrial product 
using the regression analysis. A 95% reliability and a 
5 error were used for the validation of the partial 
samples of the conductivity and density of the 
product. For the regression analysis the specialized 
software Minitab® was used in its test version. The 
results obtained in the research were that the 
regression model and the parameters of the model are 
significant, reason why, the regression analysis is 
significant. 

Keywords: Regression analysis, model, parameters. 

INTRODUCTION 

Human beings have always sought to anticipate any 
eventuality with the aim of minimizing risks in any of 
their recreational activities as well as those of an 
economic nature (Anonymous, S/A). 

In companies, the behavior is the same, many of them 
try to prevent the future, in order to meet the demand 
for their products. Keener cited by Gomez (S / A) 
points out that linear regression is used in business to 
predict events, manage product quality and analyze a 
variety of data types for decision making. Hanke and 
Reitsch (1996) argue that all organizations operate in 


an atmosphere of uncertainty and despite this fact, 
decisions must be made that affect the future of the 
organization. For organizational managers, academic 
conjectures are more valuable than non-academic 
ones. Thus, decision makers will do better if, from an 
understanding of forecasting techniques, they use 
them properly, instead of being forced to plan the 
future without the benefit of this valuable 
supplementary information. 

On the other hand, Estepa, Gea, Canadas and 
Contreras (2012) mention that among the basic 
statistical notions whose teaching must be optimized 
are those of correlation and regression. From 
prehistory to the present day, discernment about the 
possible relationship that may exist between two 
events has been an important aspect of human 
knowledge. The formation of the notions of 
correlation and regression comes, to a large extent, 
from studies in Biology, Biometry and Eugenics. The 
first author interested in the subject was 
LambertAdolphe-Jacques Quetelet (1796-1874), 
known as Adolphe Quetelet, bom in Ghent, Belgium. 
He obtained his doctorate in Mathematics with a 
thesis on conic sections, becoming director of the 
astronomical observatory of Brussels. He was a man 
of great energy, enthusiasm and organizational talent 
that he used to create several international institutions 
(Estepa, Gea, Canadas and Contreras, 2012). 
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Within companies, they seek to have their all their 
processes under control, the essential characteristics 
of a process are: variability: never two results are 
exactly the same and repeatability: the more 
repetitions the more experience (Garcia, 2016). In the 
modem business world Pelayo (S / A) says that the 
concept of management has been installed, although 
this term may be broad or unclear we will refer to 
management as defined by ISO 9000-2000 in point 
3.2 .6 where management is defined as "Coordinated 
Activities to direct and control an organization". This 
concept of management is also the one that uses the 
National Quality Award (1999) in its bases. 

Companies today use regression analysis to forecast 
future data, but many of them are SMEs unaware that 
there are mathematical models that help predict this 
data. Varela and Reyes (2009) point out that the 
purpose of forecasts is to predict the future 
development of different projects, to assist in decision 
making on projection measures such as the level of 
investment, production, and other measures that 
influence, lesser degree, on the tendency of the object 
raised, in our case the measures and actions that could 
be taken is in the level of preferences of the 
population with respect to the sector of white line. 

Quality today is fundamental for the assurance of 
conforming products, with statistical control of 
quality being fundamental (Diaz, Bautista and Ortiz, 
2013). For this reason, it is established the need to 
engage simple linear regression analysis in the 
forecast of a production process, in order to know if 
the company knows well the subject of forecast and, 
in turn, to know if it uses well the techniques of linear 
regression. It has been detected that the employees of 
the company under study make their forecasts in an 
archaic way and are unaware of the aforementioned 
issues. 

REGRESSION ANALYSIS 

The oldest discipline seems to have made a special 
pact, to contribute (almost all), each from its point of 
view, to the "gestation, birth and upbringing" of units, 
which later would be associated to constitute that 
conceptual body called Statistics and Soto, 1988). 

Seal (1967) points out that Augusto Bravais 
contributed to the development of this theory from 
another field: astronomy, when studying the errors in 
the measurements of the coordinates of space bodies. 


It was he who first used the term correlation in a study 
presented in 1846 at the Academy of Sciences in 
France. However, Pearson (1965) will indicate that 
Bravais, when studying the theory of errors, did not 
consider correlated random variables, but considered 
errors independent of each other; therefore, did not 
arrive at a true idea of the correlation, as we know it 
today. Devore, (2005) notes that the term regression 
was first used as a statistical concept in 1877 by Sir 
Francis Gabon, who conducted a study that showed 
that the height of children bom to high parents tended 
to regress or "regress "Towards the average height of 
the population. He designated the word regression as 
the name of the general process of predicting a 
variable (the height of the children) from another (the 
stature of the father or the mother). Later, statisticians 
coined the term multiple regression to describe the 
process by which several variables are used to predict 
another. 

Morales and Parra (2016) point out that, in an 
environment of uncertainty, where decision making is 
increasingly complex, the preparation of forecasts is a 
very useful tool for managers, since they are 
necessary to establish the general course of the 
organization both over a long period through long¬ 
term forecasts and in a short period designing 
immediate strategies to meet future needs through 
short-term forecasts. 

Reyes (2009) mentions that prior to the 1950s the 
efforts developed at the time were limited to analysts, 
despite handling some theories of linear regression 
and decomposition of time series, lack of appropriate 
data and tedious calculations required to obtain a 
forecast. 

A forecast is information with a certain degree of 
probability of what might happen. The probability of 
success is a direct function of the preparation of the 
forecasts. In other words, the result of the planning 
and operation of the company is directly linked to the 
certainty of the forecasts (Grijalva, 2009). Everett and 
Ronald (1991) mention that a forecast is a process of 
estimating a future event, projecting data from the 
past into the future. Past data is systematically 
combined by default to estimate the future. Zurita 
(2010) argues that forecasts support decision-making 
in different areas of business management: sales 
forecasting will help design the production plan, 
forecast commodity price developments, supplies, etc. 
Zeissig (2010) argues that forecasts can be estimated 
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by two criteria: the quantitative and the qualitative. 
The former analyzes historical data using 
mathematical models and statistics, and the second 
using knowledge of the current market situation and 
its environment. The best production forecast will be 
the one with the best information mix of both criteria. 
Chapman (2006) defines the formulation of forecasts 
of the technique to use past experiences in order to 
predict expectations of the future. 

Montgomery, D., Peck, E. and Vining (2006) mention 
that linear regression models are widely used in 
engineering since they serve to analyze the behavior 
of input (or regressor) and output (or response) 
variables predictions and estimates. On the other 
hand, Badii, Guillen, Cema, Valenzuela and Landeros 
(2012) indicate that regression and correlation are two 
closely related techniques and comprise a form of 
estimation. More specifically, correlation and 
regression analysis include the study of sampling data 
to know what two or more variables are related to one 
another in a population. Correlation analysis produces 
a number that summarizes the degree of correlation 
between two variables; and regression analysis gives 
rise to a mathematical equation that explains and 
predicts this relationship. 

Lopez and Romero (2014) mention that the simple or 
bivariate RL models are used as models of prediction 
or prognosis. The most typical case is when the 
predictor, regressor or independent variable X is a 
controlled variable (non-random), while the response 
variable or dependent variable Y is a random variable 
that has an approximately normal distribution for each 
x value of X, but with constant variance a 2 . 
Escalante (2013) mentions that regression analysis is 
a technique used to relate, through a model, one or 
more independent variables to a dependent variable 
(response). 

Orellana (2008) mentions that the simplest function 
for the relationship between two variables is the linear 
function: 

Y=a+bX 

Cardona, Gonzalez, Rivera and Cardenas (2013) 
mention that the general equation describing the 
relationship between the two variables is: 

y = a + Px + e 


Following the ideas of the same authors, they mention 
that in this model, y is a linear function of x (the part 
a + Px) plus s (Greek letter epsilon) representing the 
error y is a random variable. Anderson, Sweeney and 
Williams (2001) point out that the error term explains 
the variability in y that cannot be explained by the 
linear relationship. 

The method of least squares has a long history that 
goes back to the beginning of the nineteenth century. 
In June 1801, Zach, an astronomer Gauss had known 
two years earlier, published the orbital positions of the 
celestial body Ceres, a new "small planet" discovered 
by the Italian astronomer G. Piazzi in the same year. 
Unfortunately, Piazzi had only been able to observe 9 
degrees of its orbit before this body disappeared after 
the sun. Zach published several predictions of his 
position including one of Gauss that differed 
remarkably from the others. When Ceres was 
rediscovered by Zach in December 1801 it was almost 
exactly where Gauss had predicted (Cruces, S / A). 
The method of ordinary least squares consists of 
obtaining a hyperplane so that the sum of the squares 
of the distances between each of the observations of 
the variable and said hyperplane (residues) 
(Chirivella, S/A). 

GENERAL OBJECTIVE 

Assess the quality of a prognosis for an industrial 
product using the regression analysis. 

Specific objectives 

> Understand the contextualization of the topic. 

> Know the existing models to evaluate the quality 
of an industrial process. 

> Calculate the representative sample using 95% 
confidence. 

> Apply the regression analysis on the 
representative samples. 

> Analyze the results obtained from the regression 
analysis on the representative samples. 

> Evaluate the results obtained from the regression 
analysis on the representative samples. 
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METHODOLOGY 

The study had seven stages, Figure 1 shows an 
example of the phases. 



1. Conceptualization del tema 


2. Toma de muestras preliminares 


3. Validation de 


4. Calculo del analisis de regresion. 


5. Analisis de resultados. 


6. Conclusiones. 


7. Recomendaciones. 


Figure 1. Example of methodological steps 


1. Conceptualization of the theme. At this stage a 
bibliographic review of the topic was carried out. 


Validation of preliminary samples. At this stage 
it was validated if six preliminary samples were 
more than sufficient for the study. Both the 
conductivity and density samples were validated, 
the formula used was as follows: 


n = 


2 


Z *<T 


Where: 

n: number of samples needed for the study. 

Z: value corresponding to the gauss distribution, 
for the investigation Z = 95%, which is equal to 
1.96. 

c7 : standard deviation of the preliminary sample, 
i: error expected to be committed in the study. The 
error is set to integer. 


Very important, if the result of the formula is 
smaller than the size of the preliminary sample, 
the study is considered good or sufficient. 
Otherwise, if the number of the formula is greater 
than the size of the preliminary sample, the study 
is not suffering, which means that samples are 
missing to perform for the study to be complete. 


2. Preliminary sampling. In this phase the problem 
proposed by Dr. Grunenfelder (2017) was taken 
into account, where 6 important samples of the 
process were taken in the identification of a 
suitable substitute for biodegradable materials in 
the fast food packaging industry. Six samples of 
the conductivity and 6 samples of the density were 
taken. 


Table 1. Example of the samples taken for the 
preliminary study. 


Conductivity 

Density 

0.048 

0.175 

0.0525 

0.22 

0.054 

0.225 

0.0535 

0.226 

0.057 

0.25 

0.061 

0.2765 


The development of the formula for conductivity 
is as follows: 


' 1 . 96 * 0 . 0043 " 

2 

" 0 . 0084 " 

5 


5 


== ( 0 . 0016) 2 = 0 . 000002*1 


The study noted that using 95% reliability, a 
permissible error 5 and a standard deviation of 
0.0043, will require 0.000002 samples, if this 
number is rounded to the largest integer, it would 
be a sample. Which means that with a preliminary 
sample is more than enough, the preliminary study 
was performed six samples, which means that 
these six preliminary samples are sufficient. For 
the case of density, the result of the formula was 
0.00017. In the same way, it is rounded to the 
largest integer, giving a result of 1 sample. 
Preliminary study was performed six samples, 
which means that those with the six samples is 
sufficient for the study. 
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4. Calculation of regression analysis. In this phase the specialized software Minitab® was used in its test 
version. Data were entered: conductivity (predictor variable) and density (response variable). Figure 2 
shows an example of the above. 


jH Hoja detrabajo 1 


+ 

Cl 

C2 


X 

Y 

1 

0.0480 

0.1750 

1 

0.0525 

0.2200 

3 

0.0540 

0.2250 

4 

0.0535 

0.2260 

5 

0.0570 

0.2500 

6 

0 0610 

0.2765 

7 




Figure 2. Example of data entry to Minitab® specialized software. 


As can be seen, the previous figure shows how the data of both the conductivity variable and the density variable were 
entered. Figure 3 shows an example of how the Minitab® specialized software in its test version, the predictor variable 
(x) and the response variable (y) are chosen. 


05/10 16:30:38 


Bienvenido a Minitab, presione FI para obtener ayuda. 
Recuperando proyecto deade el archivo: 
'C:\USERS\ITMA\DESKTOP\3\MINITAB2.MPJ’ 



* 

Cl 

C2 

C3 

C4 

C5 

C6 

C7 


X 

Y 






1 

0 0480 

0.1750 






2 

0.0525 

0.2200 






3 

0.0540 

0.2250 






4 

0 0535 

0.2260 

1 

1 1 

1 



5 

0.0570 

0.2500 






6 

0.0610 

0.2765 







Regresion 


Respuesta: | y| 
Predictores: x 




Ayuda 


Figure 3. Example of choosing variables in Minitab® specialized software. 


J L- 


5. Analysis of results. At this stage we proceeded to 
examine the results obtained from specialized 
Minitab® software. Table 2 shows an example of the 
results obtained from Minitab® software. 

Table 2. Sample result of the model 


Model 


Y = -0.188 + 7.67 x 

The above table indicates the linear regression model 
that will have the study process. With this equation it 
will be possible to predict future data of the density of 
the product. 


An analysis that must be performed within the 
regression analysis is the meaning test, which 
determines in a statistical way if the regression model 
is worth obtaining (Escalante, 2013). For this and 
following the ideas of the same author, the hypotheses 
that are raised are the following: 

> Ho: B1 = 0 (There is no linear relationship 
between x, y). Regression does not make 
sense. 

> Ha: B1 ^ 0 (x is valuable to explain the 
variation of y). 
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For this, the analysis of variance was used. In the used in its test version, Table 3 shows an example 
same way, the specialized software Minitab® was of the obtained results. 

Table 3. Example of variance analysis result. 


Analisis de Varianza 

GL 

SC 

MC 

F 

P 

Fuente 

1 

0.0056371 

0.0056371 

275.84 

0.000 

Regresion 

4 

0.0000817 

0.0000817 



Error residual 

5 

0.0057189 




Total 







The above table shows the results obtained in the 
analysis of variance using Minitab® software. The 
important thing and following the ideas of Escalante 
(2013), is to put much emphasis in the column of F. If 
the F calculated in table 3 is greater to the F obtained 
from the book, it is said that Ho is rejected. According 
to Escalante's book (2013) and using a F0.05,l,4 = 
7.71, comparing this result with that of the F of Table 
3 that was 275.84, we conclude that Ho is rejected, 
which means that regression makes sense. 


Continuing with the analysis to be performed within 
the regression, the significance test for the model 
parameters was applied. For the parameter Y and x, 
the following hypothesis was used: 

> Ho: B1 = 0 

> Ha:B1^0 

For this, the analysis of variance was used using the 
specialized software Minitab® in its test version, 

Table 4 shows an example of the obtained results. 


Table 4. Example of result of analysis of variance for model parameters. 


Analisis de 
Varianza 





Predictor 

Coef. 

Coef. De 

EE 

T 

P 

Constante 

-0.18796 

0.02516 

-7.47 

0.002 

X 

7.6696 

0.4618 

16.61 

0.000 


The above table shows the results obtained in the 
analysis of variance for the model parameters using 
Minitab® software. The important thing and 
following the ideas of Escalante (2013), is to put 
much emphasis in the column of the P, that is 
equivalent to the P value. If the value p <a, Ho will be 
rejected. The P value of the parameters Y and x are as 
follows: Y is 0.002 and x is 0.000. Comparing the two 
results of the P values it is observed that the two are 
less than the value of a which is 0.05, therefore, in 
both parameters Ho is rejected, which means that the 
regression model is significant. 

6. Conclusions After analyzing the data obtained 
from the partial samples, both general and specific 
conclusions were taken, in the conclusions section the 
above mentioned is explained. 


7. Design of proposals for action. From the analysis 
of the results and conclusions obtained, lines of action 
are proposed to improve the quality of the process. 

CONCLUSIONS 

The simple linear regression forecast is an optimal 
model for trend patterns (increasing or decreasing), 
that is, patterns that have a linear relationship between 
demand and time. (Salazar, 2016). 

The present research reached the general objective set. 
It was possible to evaluate the quality of the process 
of an industrial product using the regression analysis. 
It is inferred using 95% reliability and 5 errors in the 
research, that the regression model and its parameters 
make sense, in other words, they are reliable. 
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